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ABSTRACT 

The  bootstrap  method  of  inference  is  extended  to  stratified  cluster 
samples  when  the  parameter  of  interest,  0,  is  a  nonlinear  function  g(Y)  of 
the  population  mean  vector  Y.  The  bootstrap  estimate  of  bias  of  0  *  g(y) 

a 

and  the  estimate  of  variance  of  0  are  obtained,  where  y  Is  a  design- 
unbiased  linear  estimator  of  Y.  Bootstrap  confidence  intervals  for  9  are 
also  given.  Asymptotic  justifications  are  provided. 


AMS  (MOS)  Subject  Classifications:  62D05,  62G15 

Key  Words:  balanced  or  random  half-samples,  bootstrap,  combined  ratio, 
percentile  method,  stratified  samples. 
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SIGNIFICANCE  AND  EXPLANATION 


n/ 

Most  sample  surveys  Involve  stratification  and  multi-stage  clustered 
sampling.  A  recent  trend  in  survey  data  analysis  is  inference  about  nonlinear 
statistics  from  complex  samples.  Available  methods  include  the  linearization, 
jackknife  and  balanced  half-samples.  In  the  non-survey  context,  another 
method  called  the  bootstrap  has  been  shown  to  enjoy  other  desirable 
properties,  the  most  important  one  being  that  it  reflects  the  skewness 
inherent  in  the  original  point  estimate.  It  is  shown  that  a  straightforward 
extension  of  the  usual  bootstrap  provides  incorrect  variance  estimates  and 
misleading  confidence  intervals.  A  correct  version  is  constructed  by 
adjusting  for  a  scaling  problem  before  applying  the  nonlinear 
transformation.  Several  desirable  theoretical  properties  of  the  proposed 
method  are  described.  A  detailed  study  in  the  special  case  of  the  combined 
ratio  estimator  is  given. 

\ 


The  responsibility  for  the  wording  and  views  expressed  in  this  descriptive 
summary  lies  with  MRC,  and  not  with  the  authors  of  this  report. 


BOOTSTRAP  INFERENCE  WITH  STRATIFIED  SAMPLES 


J.  N.  K.  Rao*  and  C.  F.  J.  Wu 


1.  INTRODUCTION 


Resampling  methods,  including  the  jackknife  and  the  bootstrap, 
provide  standard  error  estimates  and  confidence  intervals  for  the  para¬ 
meters  of  interest.  These  methods  are  simple  and  straightforward  but  are 
computer-intensive,  especially  the  bootstrap.  Efron  (1982)  has  given 
an  excellent  account  of  resanpling  methods  in  the  case  of  an  independent 
and  identically  distributed  (i.i.d.)  sample  of  fixed  size  n  from  an 
unknown  distribution  F  ,  and  the  parameter  of  interest  0  *•  0(F).  The 
bootstrap  confidence  intervals  for  0  take  account  of  the  skewness  in 
the  estimator  %  ■  0(F)  ,  unlike  the  symmetric  jackknife  intervals  based 
on  the  Student's  t  or  the  normal  approximation.  Moreover,  limited 
empirical  evidence  (see  Efron,  1982,  p.18)  has  indicated  that  the  bootstrap 
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standard  error  estimates  are  likely  to  be  more  stable  than  those  based  on  tr.e 
jackknife  and  also  less  biased  than  those  based  on  the  customary  delta 
(linearization)  method.  Holt  and  Scott  (1983)  applied  the  bootstrap  to 
estimate  variances  of  the  regression  estimators  from  data  obtained 
from  cluster  sanpling  without  stratification. 

The  main  purpose  of  this  article  is  to  propose  an  extension  of  the 

bootstrap  method  to  stratified  samples,  in  the  context  of  sample  survey 

data;  especially  to  data  obtained  from  stratified  cluster  samples  involving 

large  numbers  of  strata,  L  ,  with  relatively  few  primary  sampling  units 

(psu's)  sampled  within  each  stratum.  For  nonlinear  statistics  6  that 

can  be  eiqpressed  as  functions  of  estimated  means  of  p  (_>  1)  variables, 

Krewski  and  Rao  (1981)  established  the  asymptotic  consistency  of  the 

variance  estimators  from  the  jackknife,  the  delta  and  the  balanced 

repeated  replication  (BKR)  methods  as  L  «®  within  the  context  of  a 

sequence  of  finite  populations  {II  }  with  L  strata  in  II  .  Their 

L  1 1 

result  is  valid  for  any  multistage  design  in  which  the  psu’s  are  selected 
with  replacement  and  in  which  independent  subsamples  are  selected 
within  those  psu's  sampled  more  than  once.  Rao  and  Hu  (1983) 
obtained  second  order  asymptotic  expansions  of  these  variance 
estimators  under  the  above  set  up  and  made  comparisons  in  terms 
of  their  biases. 

The  proposed  bootstrap  method  for  stratified  samples  is  described 
in  Section  2  and  the  properties  of  the  resulting  variance  estimator  are 
studied.  The  bootstrap  estimate  of  bias  of  §  is  also  obtained.  Section 
3  provides  bootstrap  confidence  intervals  for  8  .  The  special  case  of  a 


ratio  8  -  Y/X  is  investigated  in  Section  4,  where  Y  and  X  are  the 
population  means  of  variables  y  and  x  respectively.  Finally,  the 
results  are  extended  to  stratified  simple  random  sampling  without 
replacement  in  Section  S. 

2.  THE  BOOTSTRAP  METHOD 

The  parameter  of  interest  8  is  a  nonlinear  function  of  the 
population  mean  vector  Y  ■  (Y^,...,Y^)  ,  say  8  *  g(Y) .  This  form  of 
0  includes  ratios,  regression  and  correlation  coefficients.  If  n^  (>_  2) 
psu's  are  selected  with  replacement  with  probabilities  p^  in  stratum  h 

A 

then  Krewski  and  Rao  (1981)  have  shown  that  the  natural  estimator  6  =  g(Y 
can  be  expressed  as  8  **  g(y)  •  Here  Y  is  a  design-unbiased  linear 
estimator  of  Y  -  and  y  *  where  wh  and  yh  m  (\l"  *  ’  '*hp 

are  the  h-th  stratum  weight  (Z*h  -  1)  and  population  mean  vector 
respectively  and  y^  is  the  mean  of  n^  i.i.d.  random  vectors  y^  = 

(yhil'***'yhip)T  for  each  h  With  E(yhi)  *  V  For  h  *  h’  '  yhi  and 

y  ,  are  independent  but  not  necessarily  identically  distributed, 
h  j 

2.1  The  Naive  Bootstrap 

In  the  case  of  an  i.i.d.  saaple  {y±}"  with  E(yi)  -  Y  ,  the 
bootstrap  method  is  as  follows:  (i)  Draw  a  sinple  random  sample  {y*}" 
with  replacement  from  the  observed  values  Yi'Y2'”"yn  and  calculate 
0*  .  g(y*)  where  y*  -  Ey*/n.  (ii)  Independently  replicate  step  (i) 
a  large  nunfcer,  B  ,  of  times  and  calculate  the  corresponding  estimates 
0*1,...,  §*B.  (iii)  Yhe  bootstrap  variance  estimator  of  §  ■  g(y)  is 


given  by 


(2.1) 


V*>  ■  pi  !  <8*b  -  §:> 


;*.2 


where  0*  “  I@*b/B.  Hie  Monte-Carlo  estimator  v  (a)  is  an  approximation 


to 


ft*.  2 


v  *  var4(§*)  *  E  (0*  -  E  0*) 

b  *  »  • 


(2.2) 


where  E*  denotes  the  expectation  with  respect  to  bootstrap  sampling 


from  a  given  sanple  y,,...,y  .  No  closed- form  expression  for  var  (8  ) 

X  n  * 

generally  exists  in  the  nonlinear  case,  but  in  the  linear  case  with  p*l, 
8*  «  y*  and  v  reduces  to 


var^(y*)  =  s2  ■  var(y) 
n2  n 


(2.3) 


2  -  2  -  2 

where  (n-l)s  =  £  (y^y)  and  var(y)  ■>  s  /n  is  the  unbiased  estimator 


of  variance  of  y.  Hie  modified  variance  estimator  [n/(n-l) ]var*(0*) 

exactly  equals  var(y)  in  the  linear  case,  but  Efron  (1982)  found  no 

advantage  in  this  modification.  In  any  case,  n/(n-l)  =  1  in  most 

^  £ 

applications  and  var#(0  )  is  a  consistent  estimator  of  the  variance  of 
§  ,  as  n  -*•  °°  (Bickel  and  Freedman ,  1981) .  Hie  bootstrap  histogram  of 


At  A 

0  , . . . ,0  may  be  used  to  find  confidence  intervals  for  0  that  take 


account  of  the  skewness  in  0  .  This  method  (Efron,  1982)  is  called  the 
percentile  method. 


Noting  the  i.i.d.  property  of  the  y.  . 's  within  each  stratum,  a 

111 


straightforward  extension  of  the  previous  bootstrap  method  to  stratified 


*  ”h 

samples  is  as  follows:  (i)  Take  a  simple  random  sample  {y*  }  with 

"h  1  iml 

replacement  from  the  given  sanple  {y^}  in  stratum  h  ,  independently 


for  each  stratum.  Calculate  y*  -  nj^E^y^,  *  Ew^y*  and  *  g(y*). 


i-1 


(ii)  Independently  replicate  step  (i)  a  large  number,  B,  of  times  and 
calculate  the  corresponding  estimates  6  , 9  .  (iii)  The  bootstrap 

variance  estimator  of  i  *  g(y)  is  given  by 

B 


(2.4) 


•  •  A * 

where  y  -  £w.  y,  and  0  *  E0  /B .  The  Monte-Carlo  estimator  v,  (a)  is 

n  n  a  o 

an  approximation  to 


var#(8*)  -  E#  <§*-£,§*) 2 


(2.5) 


where  ,  as  before,  denotes  the  expectation  with  respect  to  bootstrap 

sane  ling.  In  the  linear  case  with  p-1,  8  ■  Ew.  y.  ■  y  and  v, 

n  n  d 

reduces  to 

»«r.(y*>  -A  p^]s^  (2.6) 

2  -  2 

where  (n^-Ds^  ■  (y^  -  y^)  .  Comparing  (2.6)  with  the  unbiased  estimator 

-  -  2  2 

of  variance  of  y  ,  var(y)  ■  Iw^s^/n^  ,  it  immediately  follows  that 
var#(y*)/var(y)  does  not  converge  to  1  in  probability,  unless  L  is 
fixed  and  n^  •  for  each  h.  Hence,  var^(y*)  is  not  a  consistent 
estimator  of  the  variance  of  y.  It  also  follows  that  v.  is  not  a 

D 

consistent  estimator  of  the  variance  (or  mean  square  error)  of  a  general 
nonlinear  statistic.  There  does  not  seem  to  be  an  obvious  way  to  correct 
this  scaling  problem  except  when  n^  -  k  for  all  h  in  which  case 
k(k-l)  ^var^tfi*)  will  be  consistent.  Bickel  and  Freedman  (1983)  also 
noticed  the  scaling  problem,  but  they  were  mainly  interested  in  bootstrap 
confidence  intervals  in  the  linear  case  (p«l) .  They  have  established 
the  asymptotic  N(0,1)  property  of  the  distribution  of  t«(y“Y)/(var(y)]* 


-G- 


and  of  the  conditional  distribution  of  (y*  -  y)/lvar* (y*) ] *  in  stratified 


simple  random  sampling  with  replacement,  and  also  proved  that 


(1^  /n^)/var* (y*)  converges  to  1  in  probability  as  n  *  In^ 


o  .  o 

where  (n^-Ds**  *  ^(y*^  -  *h'  ’  Their  result  implies  that  one 


**  *1 

could  use  the  bootstrap  histogram  of  t  ,...,t  to  find  confidence 


intervals  for  Y  ,  where  t*b  =  (y*b  -  y) / [Zw^s*b2/n^] *  where  s* 
*2 


*b2 


is  the  value  of  s^  for  the  b-th  bootstrap  sample  (b  =  1,...,B) 
In  the  nonlinear  case,  there  does  not  seem  to  be  a  simple  way  to 


r*b 


construct  t  -values  similar  to  those  of  Bickel  and  Freedman  since 


v.  has  no  closed  form.  Moreover,  the  straightforward  extension  of 
b 


the  bootstrap  (hereafter  called  the  naive  bootstrap)  does  not  permit 


the  use  of  the  percentile  method  based  on  the  bootstrap  histogram  of 


Although  t*b  is  asymptotically  N(0,1)  in  the  linear  case, 
it  is  not  likely  to  provide  as  good  an  approximation  to  the  distribution 
of  t  as  a  statistic  whose  denominator  and  numerator  are  both  adequate 
approximations  to  their  counterparts  in  t.  Such  statistics  will  be 
proposed  in  Section  3.2.  These  are  also  applicable  to  the  nonlinear 
case. 


Recognizing  the  scaling  problem  in  a  different  context,  Efron 
(1982)  suggested  to  draw  a  bootstrap  sample  of  size  n^  -  1  instead 
of  from  stratum  h  (h  =  1,...,L).  In  Section  2.2,  we  will 

instead  propose  a  different  method  which  includes  his  suggestion 
as  a  special  case. 


-7- 


2.2.  The  Proposed  Method 

Our  method  is  as  follows:  (i)  Draw  a  simple  random  sample 

{y.  . }  of  size  m.  with  replacement  from  {y.  . }  .  Calculate 

i-1  n  ni  i«i 

?hi  ‘  5h  *  -  V 


?h  '  f: 


™h 

?hi " 


(2.7) 


y  -  I«hyh  ,  6  *  g(y)  . 


(ii)  Independently  replicate  step  (i)  a  large  nuntoer,  B  ,  of  times  and 

calculate  the  corresponding  estimates  8  , . . . ,  0  .  (iii)  Hie  bootstrap 

estimator  E  (8)  of  0  can  be  approximated  by  0  ■  lE^/B.  the  boot- 

a 

strap  variance  estimator  of  0  is  given  by 


^  -  vb  -  var# ( 0)  =  E^(0  -  E#8)2 


(2.8) 


with  its  Monte-Carlo  approximation 


8b(*’  ■  V*’  -sh- J.(9b  -  V2  • 

b=l 


(2.9) 


One  can  replace  E^0  in  (2.8)  by 


2. 3  Justification  of  the  Method 


In  the  linear  case,  0  »  Y  ,  v^  reduces  to  the  customary  unbiased 


variance  estimator  var(y)  ,  noting  that 

•  1  "hV'-h 


E,<y-y)2  -  Iw2  {-L  '  -2l  -  -  -2  2 

h  V1  \  l  "h 


In  the  nonlinear  case,  we  have  shown  in  Appendix  1  that 
\  m  VL  +  0p(n"2)  , 

under  the  condition  (A.l)  given  there. 


(2.10) 


(2.11) 


where  v  is  the  linearization  variance  estimator: 

J_iVy,Vy»n  -shlkl  - 

where  g..(t)  -  3g(t)/3t^  with  t  =  (tlf...,tp)T  and  t\“1)shjk  * 
Ei(yhij"  V<yhik  '  W  With  yhj  =  Eiyhij/nh-  to  the  linear  case 


(p^l),  v  reduces  to  var(y) 

it 


Under  reasonable  regularity 


conditions  (see  Appendix  1) ,  is  a  consistent  estimator  of  variance  of 

S  ,  Var{8).  Hence,  it  follows  from  (2.10)  that  v.  is  also  consistent 

b 

for  Var(8) . 

lhe  asynptotic  N(0,1)  property  of  the  conditional  distribution 
of  (9  -  0)/5b  can  be  established,  assuming  that  0  <  6^  <_  m^/fn^-l)  < 

^2  <  ®  f°r  all  h  ,  i.e.  the  bootstrap  sample  size  m^  should  be 
coup  arable  to  the  original  sample  size  n^  in  each  stratum.  The  proof 
is  omitted  since  it  follows  along  the  lines  of  Bidcel  and  Freedman  (1983). 


This  result  provides  an  asymptotic  justification  for  the  percentile  method 
based  on  the  bootstrap  histogram  of  e1,...,!®  (Section  3.1  provides  details 
of  the  percentile  method)  . 


where 


2.4.  Estimate  of  Bias  of  § 

Rao  and  Wu  (1983)  have  shown  that  the  bias  of  6  is 

1  p  L  W2 

B(0)  *  E(9)  -  6  =  —  y  g ,,  (Y)  y  —  S...+ lower  order  terms 

2  j,k-l  jk  h-1  "h 

g  (Y)  is  the  second  derivative  32g(t)/3t  3t,  evaluated  at 
jJt  3  K 


(2. 


12) 


-9- 


t-y  and  shjk  -  E(yhi.  -  Vhjl,yhi)I  -  vhkl  .  Our  bootstrap  estimate 


of  bias  is 


B ( 6)  *  E^(0)  -  0 


(2.13) 


which  is  approximated  by  0  -  0.  Prom  (A. 2)  and  (A. 3)  of  Appendix  1 

A 


we  have 


K0)  *  E*(0)-e  «  ~  ?  g,v(y)  l 
j,k«l  3  h=l 


\\  Sh^  +0p(n-2)  . 


(2.14) 


Hence,  B(@)  is  a  consistent  estimator  of  B(§)  .  On  the  other 

hand,  the  bias  estimate  B(0)  *  E#(@*)  -  0  ,  based  on  the  naive  bootstrap. 


is  equal  to 


1  p  b  w  n  -i 

B(0)  *  —  I  g.v(y)  £  (-JT — )s  +  lower  order  terms  (2.15) 

3,k=l  3*  h«=l  \  n  n3K 

which  is  not  a  consistent  estimator  of  B(§).  The  proof  of  (2.15)  is 
omitted  since  it  follows  along  the  lines  of  the  proof  of  (2.14). 


2.5.  Choice  of  m^ 

The  choice  m^  =  n^  is  a  natural  one.  The  choice  m^  =  n^-1 

gives  y  ^  *  y^\  ,  and  our  method  reduces  to  the  naive  bootstrap,  except 

that,  in  step  (i)  of  the  latter  method  a  simple  random  sample  of  size  n  -1 

* 

is  selected  from  {y^}  in  stratum  h.  However,  for  n^  small  it 

i=l 

may  not  lead  to  stable  variance  estimators.  For  n^=>2,  m^*l,  the 

method  reduces  to  the  well-known  random  half-saiqple  replication  and  the 
resulting  variance  estimators  are  less  stable  them  those  obtained  from 
BBR  for  the  same  number,  B  ,  of  half-sanples  (McCarthy,  1969) .  For 


V!  SR*-  -  , 
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V 


the  same  number  of  pseudoreplications  the  bootstrap  will  in  general  perform 
less  stably  than  the  BRR,  when  the  latter  method  is  applicable.  For  &  4, 
the  choice  m^  =  n^-1  may  be  attractive  since  the  variance  estimators  are 
likely  to  be  more  stable  and  since  the  naive  bootstrap  is 
being  used.  For  small  n^,  it  may  be  worth  considering  a  bootstrap  sample 
sire  slightly  larger  than  n^,  say  between  n^+1  and  2n^. 

3 .  CONFIDENCE  INTERVALS 

We  now  consider  different  bootstrap  methods  for  setting  confidence 
intervals  for  0  . 


3.1.  Percentile  Method 

For  ready  reference,  we  now  give  a  brief  account  of  the  percentile 
method  based  on  the  bootstrap  histogram  of  0  , ...»  8  .  Define  the 
cumulative  bootstrap  distribution  function  as 


^D^(t)  =  #{0b  <  t  ;  b  «  1,...,b}/B  .  (3.1) 

For  a  <  0.5  ,  define  9  ia.)  =  CDF  L(a)  and  0  (a)  =  CDF  ^(l-a)  . 

IwW  UP 

Then  the  interval 

*§L0W(a)'  §UP(Ct)f  (3'2) 

is  am  approximate  (1  -  2a) -level  confidence  interval  for  8  .  It  has 
the  central  1  -  2a  portion  of  the  bootstrap  distribution  (Efron  (1982), 
p.78).  One  can  also  consider  a  bias-corrected  percentile  method, 
following  Efron  (1982,  p.82).  This  method  leads  to 


{6df"1(<J'(2z0  -  zq))  ,  5dF_1(4>(2z0 +za))}  (3.3) 

as  an  approximate  (1- 2a) -level  confidence  interval  for  8  ,  where  $>  is 
the  cumulative  distribution  function  of  a  standard  normal, 

■ 
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z  *  4>-1  (CDF(@) )  and  z  *  4>  *(l-a).  The  advantage  of  the  interval 

o  a 

(3.3)  over  (3.2)  has  been  demonstrated  by  Efron  (1982). 


3.2.  Bootstrap  t-statistics 


Instead  of  approximating  the  distribution  of  6  by  the  bootstrap 

distribution  of  0  ,  we  can  approximate  the  distribution  of  the  t-statistic 

t  -  (§~0)/5.  by  its  bootstrap  counterpart  t*  ■  (0  -  §)/o*(a)  where 
D  D 

♦2  a 

a  (a)  =  v  (a)  is  the  bootstrap  variance  estimator  obtained  from  (2.9) 
b  b 

by  bootstrapping  the  particular  bootstrap  sample  {y^}  i.e.  by  replacing 
y  yhi  in  the  proposed  method.  For  the  second  phase  bootstrapping 

one  could  choose  values  (n£,  b')  different  from  (m^B) .  Ibis  double¬ 
bootstrap  method  thus  leads  to  B  values  t*1,...^*3  of  t*. 

*1  *B 

Utilizing  the  bootstrap  histogram  of  t  ,  ...,t  ,  we  define 


CDFt(x)  =  #{t*b  <  x}/B  ,  t^  -  CDF~A(a),  t^  -  CDF-i(l-a)  ,  and 
construct  an  approximate  (1- 2a) -level  confidence  interval  for  0  given 
by 


•-1, 


-1. 


-  vb  '  6  -  wU  • 


LOW  bJ 


(3.4) 


The  interval  based  on  the  t-statistic  is  likely  to  be  better  than  the 
interval  based  on  the  percentile  method  (Babu  and  Singh,  1983) . 

We  now  provide  an  asymptotic  justification  for  t*.  Noting  that 
v*(a)  is  a  Monte  Carlo  approximation  to 

D 

5b2  "  \  "  E**®*)2  (3-5> 

where  0*  is  the  value  of  6  obtained  from  bootstrapping  the  particular 
saitple  {yhi}  803  the  8econ<i  phase  bootstrap  expectation,  we 


J 


can  write  t*  =  (0  -  @)/6*.  In  the  linear  case  9  «  Y  it  is  easily 

o 

seen  that 

E*5b2  “  5b  •  (3-6> 

In  the  nonlinear  case,  following  Bickel  and  Freedman  (1983),  we  can  show 
~*2  -2 

that  a.  /a.  converges  to  1  in  probability  as  n  00  .  Hence,  it 

O  D 

follows  that  the  conditional  distribution  of  t*  is  asynptotically  N(0,1). 

One  could  use  a  jackknife  t-statistic  t,  “  (§  -  0)/6  instead  of 
t  ,  where  8 ^  *>  v  is  a  jackknife  variance  estimator  of  §  (see  Krewski 

J  J 

and  Rao,  1981) .  The  corresponding  confidence  interval  is  then  given  by 

{8  -  t^,  8  -  (3.7» 

where  t _ _  and  t__  are  the  lower  and  upper  a-points  of  the  statistic 

LOW  UP 

t*  -  (9-0)/o*  obtained  from  the  bootstrap  histogram  of  t* 1 , ,  t*B  , 

•J  iJ  J  J 

♦2  2 

and  Q  is  obtained  from  8  by  jackknifing  the  particular  bootstrap 
J  J 

sample  {y.,}.  It  can  be  shown  that  the  confidence  interval  (3.7)  is 
ni 

also  asynptotically  correct.  A  confidence  interval  of  this  type  was 

considered  by  Efron  (1981)  in  the  case  of  an  i.i.d.  simple  (y^K 

'•l 

It  is  also  possible  to  replace  0  by  the  BRR  or  the  linearization 

%J 

variance  estimator  and  obtain  a  confidence  interval  similar  to  (3.7). 

4.  COMBINED  RATIO  ESTIMATOR 

The  combined  ratio  estimator  of  the  ratio  0  -  g(Y,  X)  *  Y/X  * 

R  (say)  is  given  by  8  *  g(y,x)  *■  y/x  «=  r  (say)  where  y  ■  I W,  y.  and 

n  n 

x  *  Zw^x^.  The  corresponding  bootstrap  estimator  is  §  ■  g(y,x)  “  y/x 
-  r  (say) ,  where 
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y  -  y 


5  +  Ewh^r  <5h'V  -  y +  Ay< 


and 


(4.1) 


the  bootstrap  estimator  of  variance  of  r  is  given  by 


vfa  -  E,(r  -  r) 


vL-r3E 


x  n^/m^Cn^-l)  e  xh 


(4.2) 


+  O  (n-3)  , 

P 

assuming  that  max  1^/1^  *  3)  and  0  <  6^  <,  ^/(n^-l)  _<  62  <  •  for 

all  h  (see  Appendix  2) .  Here  v  is  the  linearization  variance  estimator 

Xj 


and 


1  _  h  2 

v.  »  — r  I  —  s^. 

Jj  -2  n^  eh 


(nh"1,S-2  “  ®hi(xhi"V'  *hi  “  yhi‘yh  _r(xhi"*h)  ' 

e  xh  i«l 

2  2  a 

and  8^.  ,  s  ^  ,  Sa.^  are  respectively  the  sanple  variance  of  ehi  , 


(4.3) 


th 


eh  '  *xh  '  *exh 

x^  and  the  sanple  covariance  of  ehi  and  x^  in  the  h  stratum. 

second  term  of  (4.2)  is  zero  if 


Since  s  *0  for  n.  *  2, 
e^xh 

n^  ■  2  for  all  h.  Since  th 
-2 

order  O  (n  )  ,  we  have  v  ■ 

p  b 


-m  of  (4.2)  is  positive  and  of 
-2 

)  in  general.  On  the  other 

-3. 


hand,  the  jackknife  variance  estimator  satisfies  v_  -  v_  +  0  (n  )  in 

w  Xj  P 


the  special  case  of  n  *2  for  all  h  (Rao  and  Wu  1983).  Hie  jackknife 

n  9 

is  too  close  to  v  in  the  latter  case, 

L 

To  obtain  the  bias  of  ,  note  that,  when  e^  ■  y^  -  5h  -  R(x^  -  X^) 

replaces  its  sanple  analaogue  in  (4.2),  the  only  effect  on  (4.2) 

-2.5  -3 

is  that  the  error  term  is  0^(n  )  instead  of  0^{n  )•  By  working  on 

this  modified  formula  of  (4.2)  and  noting  that  Es  ,  “  (n, -2)  s  _  /n, 

e2xh  *  e2xh  " 

where  S  _  is  the  population  analogue  of  s  -  ,  we  get  the  bias  of  v.  : 

e2xh  exh  b 

,  Wh  "l,-2 

Bias (v  )  -  Bias(v  ) - -  £  —r - r  s  _ 

*  \  fVV1^  e  ^ 

W2  w2  w2 

.  -^r(l  —  S2.)IZ  -^sf)  *-%(L-S-s  > 2  +0(n”3>  (4.3) 

X4  “h  eh  "h  xh  X4  "h  “h 


»  Bias (v  )  -  2a”  ♦  3b  +  6c  (say) 

1* 

2  2  2  2 
where  seh*  sxh  end  seJ(h  are  the  population  analogues  of  Sa^,  s^ 

and  Sa  respectively.  Using  the  result  (Wu,  1982) 
exn 


Bias(v )  »  -2a  +  b  +  0(n  ) 

L 


(4.4) 


where 


get  from  (4.3) 


.  _  _L  r  JL  s 
a  -3  L  2  S  2 
X  e  xh 


Bias  ( v.  )  «  -2a  -  2a"  +  4b  +  6c  +  0(n  )  . 

D 

In  the  special  case  of  •  2  for  all  h  ,  a"  *  0  and 


Bias  ( v.  ) 

D 


“"‘W 


Bias(vBKR_g)  «-2a+4b+6c 


“3. 


to  Oln  )  ,  »h.r.  vBTO_H  «d  vBBR.s 


(4.6) 

are  the  BRR  variance  estimators 


one 


(see  Rao  and  Wu,  1983) .  Zn  the  general  case  of  ?  2  for  at  least 
h  ,  Bias(vfa)  depends  on  the  bootstrap  sanple  sizes  {ra^}.  In  particular, 
if  m^  »  we  haw  Bias(v^)  =  -2a+4b+6c  to  0(n  3) .  The  choice 
m^  *  n^-1  (Efron,  1982)  leads  to 


Wh(n.-2)  _ 

Bias (v  )  ■  Bias ( v_ )  --rr  I  a - S  +  3b  +  6c  +  0(n  )  . 

b  L  X3  n^(n^-l)  e2xh 


(4.7) 


STRATIFIED  SIMPLE  RANDOM  SAMPLING  WITHOUT  REPLACEMENT 


All  the  previous  results  apply  to  the  case  of  stratified  simple 
random  sampling  without  replacement  by  making  a  slight  change  in  the 
definition  of  y 


fhi  * 

»hi  *  ?h  *  ”h(nh '  1,''u  -  v'lyu‘V 


(5.1) 


where  • 

to  observe  that,  even  by  choosing  -  n^-1  ,  y^  /  y*^  .  Hence  the 
naive  bootstrap  using  will  still  have  the  problem  of  giving  a  wrong 

scale  as  discussed  before.  In  the  special  case  of  n^  >2  for  all  h  , 
McCarthy  (1969)  used  a  finite  population  correction  similar  to  (5.1) 
in  the  context  of  BRR. 

Bickel  and  Freedman  (1983)  considered  a  different  bootstrap  sampling 

method  in  order  to  recover  the  finite  population  correction,  1  -  f .  , 

h 

in  the  variance  formula.  This  method  essentially  creates  populations 
consisting  of  copies  of  each  y  .  , 


n^/N^  is  the  saspling  fraction  in  stratum  h.  It  is  interesting 


i  *  1,  • .  * ,  n.  and  h  *  1 , . . . ,  L 


16- 


*  nh 

and  then  generates  {y*^}  as  a  sinple  random  sample  without  replacement 
from  the  created  population,  independently  in  each  stratum.  This 
"blow-up"  bootstrap  was  first  proposed  by  Gross  (1980)  and  also  independ¬ 
ently  by  Chao  and  Lo  (1983) .  The  variance  estimator  resulting  from 
this  method  (by  working  directly  with  y*^)  ,  however,  remains  inconsistent 
for  estimating  the  true  variance  of  6  .  It  is  possible,  however,  to 
make  the  variance  estimator  consistent  by  reducing  the  bootstrap  sample 
size  to  n^-1  ,  as  in  Section  2.  In  comparison  with  our  method,  the 
"blow-up"  bootstrap  is  somewhat  harder  to  implement,  somewhat  artificial 
and,  if  the  stratum  size,  ,  is  not  a  multiple  of  n^  ,  requires  an 
artificial  randomization  for  choosing  between  two  created  populations. 
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APPENDIX 


1.  Proof  that  v.  *  vT  +  O  (n  )  ,  assuming  that 

"  D  u  D 


max  W^/i^  *  0(n  *) ,  0  <  6^  £  m^/tn^-l)  <  $2  <  60 • 


(A.l) 


The  condition  (A.l)  allows  L  to  be  either  bounded  or  unbounded. 


Writing  y  -  y  «  Ay*  =  (Ay* .  Ay*) 1  ,  where 

P 


T 


Ay,  *  Ew  ,  (y*  —  y  ) 

j  h  Vi  h*  “3 


we  have 


L  W2 

var#(Ay*)  *  l  ~  «*.  -  Q >  (n"1) 
j  h-1  °h  hj  p 
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2  -  2 

under  the  assunption  (A.l)  and  the  boundedness  of  “  E(y^^  ”^hj^  ' 


2  -  *  _  L 

i.e.  max  S.  .  <  00  .  Hence,  Ay  *  0  (n  )  and 
*0  p 


S  -  §  *  (Ay  )Tg‘ (y)  +  j(Ay*)Tg"(y) (Ay*)  +  Opto  *) 


3 

2. 


(A. 2) 


.  T 


where  g  (y)  •  (g^y),...,  g^y))  and  g  (y)  is  the  p  *  p  matrix 


with  elements  g Cy > .  Therefore, 


-  £^(0  -  @)2  -  f  gj(y)gJt(y)E#(Ay*Ay*) 
j,k“l 

+  f  gj(?)gk£(?)E.(A?j*A?k*Ay;,  +  CMn_‘)  . 
j,k,£«l 


-2. 


Now  noting  that 


L  H 

E,(Ay*Ay*)  -  [  —  s 

j  k  h-l"h  hJk 


(A.  3) 


and 


,3.  "b_,2 


1  % 


•  -f  %t\(V1>3  *hjkt  *  yn'2) 

"h 


under  (A.l),  we  get  the  desired  result.  Here  (n^  ■  J  ^yhij”^hj*' 

<Yhik  ~  Yhk} (yhi£  *  Yh£J ’ 


2.  Proof  of  (4.2).  He  follow  the  approach  in  Appendix  4  of  Hu  (1982)  to 
derive  (4.2).  Under  (A.l)  we  have  Ax*,  Ay*  and  Ae*  •  Ay*  -  rAx*  all 


-V 


of  the  order  0^(n  ).  Hence,  noting  that 
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r*  .  ,  *  C  (1  -  £  *  (4iV>  ♦  Op(n‘2l 


we  get 


v.  -  E.  (r*  -  r) 

D 


-  x"2[e*(AS*)2  -  2E,(Ae") 


2  Ax* 


(A. 4) 


♦  3E  lt;‘f ♦  0  (n-3) 

x  P 


Now t  writing  A5*  -  where  ^  -  Vh(nh_1)  H  and 


«  ?  <4  -  ?  {y^i  -  *h  -  '<<1  -  '  we  *et  «*  followin9 


Vh  ■  i^1  ’hi 


results : 


(A.5) 


L  K  2 


E*[<Ai*)2Ax*]  -  2djjE«[eJ2(xJ  ~  \)1 


-  <  ,,  A  ^1<v '  *h> 

h-1  n^  /(n^-Dm^  n  i-1 


L  wh 

I  -5 


1  "s  ”h 

i  4“ 


h«!  «  3111 


(A. 6) 


and 


e,[<4JN2<a;*>2]  -  E,[(Ii^IJ2)  (La^l^-^l2)]  * 


hj*h'  hfQi' 


2uL  *V”"3) 

nfn 

-  (E  —  8^  )  (E  ~  8^)  + 

*h  "h  &  "h  °h  A 

m&K'K'  v]2+0p(n'3) 

-  (E  —  si  )  (E  —  ■!,)  +  2(E  i  s.  )2  +  o  (n"3) . 

eh  xh  exh  p 

Substituting  (A.  5)-(A.7)  in  (A.4)  we  get  the  desired  result. 
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